Learning Optical Flow, Depth, and Scene Flow Without Real-World Labels

نویسندگان

چکیده

Self-supervised monocular depth estimation enables robots to learn 3D perception from raw video streams. This scalable approach leverages projective geometry and ego-motion via view synthesis, assuming the world is mostly static. Dynamic scenes, which are common in autonomous driving human-robot interaction, violate this assumption. Therefore, they require modeling dynamic objects explicitly, for instance estimating pixel-wise motion, i.e. scene flow. However, simultaneous self-supervised learning of flow ill-posed, as there infinitely many combinations that result same point. In letter we propose DRAFT, a new method capable jointly depth, optical flow, by combining synthetic data with geometric self-supervision. Building upon RAFT architecture, an intermediate task bootstrap triangulation. Our algorithm also temporal consistency losses across tasks improve multi-task learning. DRAFT architecture simultaneously establishes state art all three setting on standard KITTI benchmark.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Dynamic 3D Scene Depth Reconstruction via Optical Flow Field Rectification

In this paper, we propose a depth propagation scheme based on optical flow field rectification towards more accurate depth reconstruction. In depth reconstruction, the occlusions and low-textural regions easily result in optical flow field errors, which lead ambiguous depth value or holes without depth in the obtained depth map. In this work, a scheme is proposed to improve the precision of dep...

متن کامل

Scene Flow from Depth and Color Images

In this paper we consider the problem of estimating a 3D motion field using multiple cameras. In particular, we focus on the situation where a depth camera and one or more color cameras are available, a common situation with recent composite sensors such as the Kinect. In this case, geometric information from depth maps can be combined with intensity variations in color images in order to estim...

متن کامل

Learning Optical Flow from Real Robot Data

This project presents a method for teaching robot arms optical flow. By extending recent works on depth based, probabilistic robot tracking, a novel real-world optical flow dataset with dense ground truth annotation is generated. A real, robotic dataset boasts the inclusion of phenomena that synthetic datasets cannot model. With this data a generic convolutional neural network is implemented to...

متن کامل

Combining Stereo Disparity and Optical Flow for Basic Scene Flow

Scene flow is a description of real world motion in 3D that contains more information than optical flow. Because of its complexity there exists no applicable variant for real-time scene flow estimation in an automotive or commercial vehicle context that is sufficiently robust and accurate. Therefore, many applications estimate the 2D optical flow instead. In this paper, we examine the combinati...

متن کامل

Motion and Depth from Optical Flow

Passive navigation of mobile robots is one of the challenging goals of machine vision. This note demonstrates the use of optical flow, which encodes the visual information in a sequence of time varying images [1], for the recovery of motion and the understanding of the three dimensional structure of the viewed scene. By using a modified version of an algorithm, which has recently been proposed ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: IEEE robotics and automation letters

سال: 2022

ISSN: ['2377-3766']

DOI: https://doi.org/10.1109/lra.2022.3145057